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• Premise of the study: Novel nuclear exon-primed intron-crossing (EPIC) markers were developed to increase phylogenetic 
resolution among recently diverged lineages in the frankincense and myrrh family, Burseraceae, using Citrus, Arabidopsis, and 
Oryza genome resources. 

• Methods and Results: Primer pairs for 48 nuclear introns were developed using the genome resource IntrEST and were screened 
using species of Commiphora and other Burseraceae taxa. Four putative intron regions (RPT6A, BXL2, mtATP Synthase D, and 
Rah6) sequenced successfully for multiple taxa and recovered phylogenies consistent with those of existing studies. In some 
cases, these regions yielded informative sequence variation on par with that of the nuclear ribosomal DNA internal transcribed 
spacer. 

• Conclusions: The combination of freely available genome resources and our design criteria have uncovered four single-copy 
nuclear intron regions that are useful for phylogenetic reconstruction of Burseraceae taxa. Because our EPIC primers also 
amplify Arabidopsis, we recommend their trial in other rosid and eudicot lineages. 
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Resolving phylogenetic relationships among closely related 
angiosperm species is frequently hindered due to limited vari- 
ation in currently available markers (Li et al., 2008; Zimmer 
and Wen, 2012). This challenge is no less problematic in the 
myrrh genus, Commiphora Jacq. (Burseraceae), where com- 
plete, species-level resolution has not been achieved despite 
the use of multiple gene regions (Weeks and Simpson, 2007). 
We describe the development and evaluation of four novel, 
exon-primed intron-crossing (EPIC) markers for Burseraceae 
(Sapindales) using a repository of putative, intron-flanking 
nuclear unigenes from 43 plant taxa and two complete refer- 
ence genomes (IntrEST; Hut and Doyle, 2012). Markers were 
evaluated for their phylogenetic utility at the species level us- 
ing a recently radiated lineage of Commiphora and a generic- 
level sampling in Burseraceae. Sequence variation from these 
novel markers was compared to existing nuclear markers and 
shows promise for resolving relationships at both shallow and 
deeper phylogenetic scales. 
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METHODS AND RESULTS 

Marker development — Development of EPIC markers for Burseraceae in- 
volved unigene data sets of Citrus Clementina hort. ex Tanaka and C. sinensis 
(L.) Osbeck (Rutaceae; Sapindales) and two reference genomes available in 
IntrEST, Oryza sativa L. and Arabidopsis thaliana (L.) Heynh. We developed 
1 2 primer pairs for putative introns from each of four predicted amplicon size 
categories (200-bp increments between 400-1200 bp), resulting in 48 total 
primer pairs. For each size category, six primer pairs were developed from a 
percent-identity criterion of either 80-89.9% or 90-100% between the unigene 
and the corresponding reference. We predicted that the lower percent-identity 
criterion (80-89.9%) might yield more informative variation among closely 
related species. Half of the primer pairs were generated using C. Clementina and 
the other half from C. sinensis unigenes. Primer sequences were a consensus 
between unigene and the corresponding reference genome. Primers were pref- 
erentially designed using A. thaliana. Primers were designed between 18-30 bp, 
within 50 bp of putative intron splice regions in the reference genome, hav- 
ing a melting temperature (T m ) between 51-74°C, without predicted dimers, 
and a 35-60% G-C content. Primer characteristics were evaluated using Oligo- 
Evaluator (Sigma-Aldrich, St. Louis, Missouri, USA). Exceptions for T m and 
%GC were made for 1 8 primers where it was not possible to meet all necessary 
criteria (Appendix 1). Each primer pair was tested by its ability to amplify a 
single PCR product from two species of Madagascan Commiphora (C. lamii 
H. Perrier and C. ankaranensis (J.-F. Leroy) Cheek & Rakot.) and a positive 
control (A. thaliana) and to sequence cleanly. 

Taxonomic sampling and molecular methods — Markers that passed all of 
the above criteria were evaluated using 14 species of Burseraceae (Appendix 2), 
including eight Commiphora ingroup species and six outgroup species from 
closely (Bursera Jacq. ex L., Aucoumea Pierre) and distantly (Boswellia Roxb. 
ex Colebr., Protium Burm. f., and Beiselia Forman) related genera, respectively 
(Weeks et al., 2005; Thulin et al., 2008). All ingroup taxa are Madagascan, and 
seven correspond to one of two species-rich clades in Madagascar. We sampled 
densely from one clade to test phylogenetic utility at shallow scales. Whole 
genomic DNA was extracted from specimens using the FastPrep FastDNA Spin 
Kit (BiolOl Systems, La Jolla, California, USA). All markers were amplified in 
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25-pL PCR reactions including: 0.5 pL forward and reverse primers (5 pM), 
0.5 pL spermidine (4 mM), 2 pL total DNA, and 5 pL GoTaq Green Master 
Mix (Promega Corporation, Madison, Wisconsin, USA). A ramp-up PCR ther- 
mocycler protocol followed a 4-min presoak at 94°C with 35 cycles of 30 s at 
94°C (denaturation), 1 min at 48-56°C (annealing), and 50 s at 72°C (exten- 
sion), followed by a 4-min postsoak at 72°C PCR products were purified prior 
to sequencing reactions using ExoSAP (USB Corporation, Cleveland, Ohio, 
USA) and sequenced by Macrogen (Rockville, Maryland, USA) using an ABI 
3730XL Analyzer with BigDye Terminator version 3.1 (Applied Biosystems, 
Foster City, California, USA). Sequencing reactions (10 pL) for both directions 
included 40 ng/pL template. Products were assembled and edited using Se- 
quencher 4.7 (Gene Codes Corporation, Ann Arbor, Michigan, USA). 

Phylogenetic analyses — Multiple sequence alignment (MSA) was per- 
formed using MUSCLE version 3.7 (Edgar, 2004). Gap regions in the MSA 
were treated as missing data. Markers were evaluated using maximum parsi- 
mony (MP) and Bayesian inference (BI). MP analyses were conducted using 
PAUP* 4.0b 10 (Swofford, 2002) with a two-step protocol modified from 
Plunkett et al. (2005). Branch support for internal nodes was inferred by 
bootstrapping 1000 replicates in PAUP*. BI analyses were performed using 
MrBayes version 3.1.2 (Ronquist and Huelsenbeck, 2003). Two runs were 
performed for each data set using the best-fitting model as determined by 
jModelTest (Posada, 2008) consisting of four chains each for 10 million gen- 
erations sampled every 1000 generations; 10% sampling was discarded as 
burn-in for each run. MSA and BI analyses were performed in the CIPRES 
Science Gateway (Miller et al., 2010). 

Marker evaluation — Fifteen of the 48 EPIC primer pairs (31%) amplified 
at least one species, and four pairs (8%; 10F-10R, 16F-16R, 39F-39R, and 43F- 
43R) produced amplicons that sequenced cleanly for multiple taxa. Provisional 
marker names are provided based on gene ontology categories from reference 
taxa (Table 1). When searched in BLAST, sequences of putative intron regions 
for RPT6A, BXL2, and Rab6 (Appendix 2) matched gene ontology categories 
predicted for the Arabidopsis and Oryza references in IntrEST. Sequence prod- 
ucts for mtATP Synthase D (Appendix 2) did not BLAST to predicted gene 
ontology categories. Sequences produced by this study have been deposited 
into GenBank (Appendix 2). Sequence alignment files are deposited in the 
Dryad Digital Repository (http://doi. org/10. 5061/dryad.382p0; Gostel and 
Weeks, 2014). Phylogenetic statistics of new EPIC markers are presented in 
comparison (Table 1) with those from nuclear markers developed for previous 
phylogenetic studies of Burseraceae (ETS: Weeks and Simpson, 2007; ITS: 
Gostel and Weeks, unpublished). Phylogenetic analysis of EPIC markers devel- 
oped in this study recovered well-resolved phylogenies consistent with those 
from previous studies (Fig. 1). The concatenated set of all four EPIC markers 
resulted in improved phylogenetic resolution compared to previously devel- 
oped markers (Fig. 1). 

Critical assessment of primer design criteria — Each of the 15 primer pairs 
that amplified at least one species spanned the range of melting temperatures 
(5 1-74°C), differed from their pair by less than 10°C in T m , and were developed 
from both Citrus unigene data sets and both reference genomes. More than half 
(9/15) of these markers were designed using 80-89.9% identity criteria, yet 
only two (16F-16R and 43F-43R) sequenced cleanly for multiple taxa. Two of 
the six primer pairs developed from the 90-100% identity criterion (10F-10R 
and 39F-39R) sequenced cleanly for multiple taxa and yielded the most infor- 
mative variation among Commiphora species. These results do not support 
predictions that lower percent identity would provide better shallow-scale 
phylogenetic resolution, which suggests mutation rates between exon and intron 
regions are independent. 



CONCLUSIONS 

The EPIC markers developed in this study may also be useful 
for phylogenetic reconstruction in other angiosperm taxa. Most 
primer pairs amplified A. thaliana (Brassicales), and they may 
work in other rosid or eudicot taxa. Of the four markers, RPT6A 
is most promising for further evaluation. This ca. 400-bp region 
sequenced cleanly for all Burseraceae taxa and yielded a 
percentage of phylogenetically informative characters on par 
with ITS. Our study demonstrates how genomic resources from 



^3 



E— 

03 

Z 
Q 

a 
S 



y 
03 



^3 









4 








•o 


~z 


DC 

d 


OS 




itatistic 


4,0.7 


d 


DC 


36, 0. 


6, 0.4 




d 


d 


OS 


d 


u 


d 


•■6 




d 


en 

<N 

r- 


t3 

'» 






r> 

OS 


as 
r- 




d 


d 


d 




d 




oo 


■* 




d 


as 


1 




a*. 








Fa 




a\ 


© 


DC 




in 


3 


o 


DC 


as 

DC 
















o 

a\ 
o 

oz 
d 



o o 

C5 O 

ft Eh 

U ft 

O O 

U sS 

A c2 

ft Eh 

C5 >H 

>H Eh 

ft O 

U Eh 

>h Eh 

Eh U 

ft Eh 

O >H 

U ft 

Pi Eh 

<; a 

U Eh 

O O 

Eh O 

O ft 



ft 

u 

Eh 

O >H 

<! O 
ID O 

a ^ 

U Eh 
Eh U 
<! >H 

cj u 
o o 

ft Eh 

3, ft 

u u 



ft 

o u 

cs o 

<; ft 

a ei 

Eh Eh 

U ft 
Eh 

2 >H 

U ft 

A ft 

u o 

ft a 

Eh K 

>h u 

O CJ 



<; 

cj 

ID 

I 

<: 

O ft 

ft cj 

ft cj 

O Eh 

ft ft 

ft EH 

O ft 

ft CJ 

Eh ft 

<; CJ 

o cj 

ft cj 

cj u 

ft » 

ft o 

CJ Eh 

Eh ft 

Eh CJ 

CJ CJ 

O Eh 



u 
ft 
cj 
cj 



cj 

EH 

< 

u 
u 
ft 
u 

cj cj cj 

ft a ft 

Eh Eh CJ 

Eh CJ ID 

CJ ft < cj 

CJ Eh ft CJ 

Eh O CJ ft 

Eh ft CJ CJ 

ID O ft ft 

Eh Eh CJ CJ 

cj < < cj 

Eh Eh Eh O 

CJ ft Eh ft 

CJ CJ EH CJ 

Eh U ft U 
<<<UU 

Eh ft Eh ft 

ID O ft O 

U ft Eh ft 

U U Eh U 

Eh ft U B 

Eh U U U 



£ 
O 

U 



3 



oq 



o r__, *5 i— i 



H 

03 



http://www.bioone.org/loi/apps 



2 of 6 



Applications in Plant Sciences 2014 2(4): 1300098 
doi:10.3732/apps,1300098 



Gostel and Weeks — Novel EPIC markers for the Burseraceae 



RPT6A 
Intron 




C. sp. B 1011504 

C. aprevalii 1012004 

C. falcata 1012603 

C. grandifolia 1011301 

C. lamii 1012602 

C. pervilleana 1011101 

C. sp. A 1010901 

C. ankaranensis 1011102 

B. saru/f/7an/7 99VIII1701 

B. simaruba s.n. 

/I. klaineana 466 

P. copa/3136 

S. sacra 01X0803 

B. mexicana s.n. 



BXL2 
Intron 




C. sp. B 1011504 

C. fa/cafa 1012603 

C. pervilleana 1011101 

C. sp. A 1010901 

C. grandifolia 1011301 

C. /am/71012602 

C. ankaranensis 1011102 

B. simaruba s.n. 

B. saru/tr/an//99VIII1701 

A. klaineana 466 
P. copa/3136 

B. mexicana s.n. 



mtATPSy- e%/<| 
nthase D 7%- 
Intron 

100/1 




C. aprevalii 1012004 

C. grandifolia 1011301 

C. sp. B 1011504 

C. fa/ca/a 1012603 

C. /am/71012602 

C. ankaranensis 1011102 

B. simaruba s.n. 

P. copa/3136 

B. mexicana s.n. 



Kab6 
Intron 




C. aprevalii 1012004 

C. fa/cate 1012603 

C. grandifolia 1011301 

C. /am// 1012602 

C. sp. B 1011504 

C. pervilleana 1011101 

C. sp. A 1010901 

C. ankaranensis 1011102 

P. copa/3136 

B. sacra 01X0803 



nrDNA 
(ETS & ITS) 



99 / .95 
100/. 95 











</< 

















</< 




C. sp. B 
C. falcata 
C. aprevalii 

C. lamii 
C. grandifolia 
C. pervilleana 

C. sp. A 
C. ankaranensis 
Bursera sarukhanii 
Bursera simaruba 
Aucoumea klaineana 
Protium copal 
Boswellia sacra 
Beiselia mexicana 



Concatenated 
new markers 




Fig. 1 . Phylogeny of 14 representative taxa in the Burseraceae sampled in this study. Values above branches correspond to maximum parsimony bootstrap 
support values, followed by Bayesian posterior probabilities. "Concatenated new markers" refers to a concatenated data set of all four new markers. Refer 
to Table 1 for individual marker statistics. 
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model organisms can be leveraged to advance the phylogenetic 
systematics of nonmodel organisms. 
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Appendix 1. List of all 48 primer pairs developed in this study and their characteristics. 



Locus" 


Primer sequence (5'-3') 


T m 


%GC 


Nbases 


Reference taxon b 


EST C 


%ID d 


Citrus sp. e 


IF' 


GAT CWGARAT C GCMGARGAAGT Y C GC 


60.7 


46.2 


26 


Arabidopsis 


AT3G12290.1 


85 


sinensis 


1R ; 


TCAGCRCAMGCYTTYCTCTTCRTRYTC 


74 


37.1 


27 


Arabidopsis 


AT3G 12290.1 


85 


sinensis 


2 pt 


TGCAARTCTCTYKTTGCYGG 


65.3 


40 


20 


Arabidopsis 


AT4G0 1100.1 


83 


sinensis 


2R : 


TCCARWGGWGCAACAGCA 


61.1 


50 


18 


Arabidopsis 


AT4G0 1100.1 


83 


sinensis 


3F 


TACACRTATGCWAGRTGCAC 


63.6 


40 


20 


Arabidopsis 


AT1G48410.2 


87 


sinensis 


3R 


GCTGCWAGRTGKGCATARTATGC 


65.9 


43.5 


23 


Arabidopsis 


AT1G48410.2 


87 


sinensis 


4Ft 


ATTCTYGTCYTGTCCGSWAGAGA 


61 


39.2 


23 


Arabidopsis 


AT4G2 1960.1 


83 


Clementina 


4Rt 


CCATCTCTYCTTCCTGTCTT 


51.2 


45 


20 


Arabidopsis 


AT4G2 1960.1 


83 


Clementina 


5F 


GCTGGAYTMACSSTYGAYCATCC 


64.3 


39.2 


23 


Arabidopsis 


AT1G22410.1 


84 


Clementina 


5R 


TCATGRGAWGTCCAGAASTC 


67.7 


40 


20 


Arabidopsis 


AT1G22410.1 


84 


Clementina 


6F 


ATGCTKTTTGGTGCWRTTGGGAC 


74 


43.5 


23 


Arabidopsis 


AT5G27 150.1 


82 


Clementina 


6R 


GAAC T GWRT WAC AC C T AGWGAT AT GA 


57.3 


34.7 


26 


Arabidopsis 


AT5G27 150.1 


82 


Clementina 


7F 


GCAKCACCRAAGATGYTGAAC 


62.4 


42.9 


21 


Arabidopsis 


AT1G34130.1 


91 


sinensis 


7R 


CAGCTCWCCRAAYCKRTARTAKSATA 


60 


27 


26 


Arabidopsis 


AT 1G34 130.1 


91 


sinensis 


8F 


GCTGTTGCSYTGAARCAGGC 


64.8 


50 


20 


Arabidopsis 


AT4G 13930.1 


92 


sinensis 


8R 


CTTGTTTYGCRTAGRCCTTGAA 


60 


36.4 


22 


Arabidopsis 


AT4G 13930.1 


92 


sinensis 


Q F ! 


ATATCARGGTGCY T AC AAG A 


57.1 


35 


20 


Arabidopsis 


AT5G50850.1 


90 


sinensis 


9Rt 


CTCAGGRCCATATTTCTCCAA 


58.1 


42.9 


21 


Arabidopsis 


AT5G50850.1 


90 


sinensis 


10Ft 


CTCCARCACATYCAY GARC T C C AGC 


55.6 


48 


25 


Arabidopsis 


AT5G 19990.1 


93 


Clementina 


10R f 


AGCTGTAAYTCTTCTYTRAGCATCC 


61.7 


36 


25 


Arabidopsis 


AT5G 19990.1 


93 


Clementina 


1 IF 


AC GMC T Y GA Y AT GGAT T AC GT Y GA 


54.8 


37.5 


24 


Arabidopsis 


AT1H04690.1 


90 


Clementina 


11R* 


TTCATCGCCCTCACMGTCTCYTC 


77 


52.2 


23 


Arabidopsis 


AT1H04690.1 


90 


Clementina 


12F+ 


CGTYGAYGTKMTYTATTGYCACA 


57.7 


30.5 


23 


Arabidopsis 


AT1G04690.1 


90 


Clementina 


12R f 


TCATCGCCCTCACMGTCTC 


67.1 


57.9 


19 


Arabidopsis 


AT1G04690.1 


90 


Clementina 


13Ft 


ACAAGCCWCCTGAAGATGC 


62 


52.7 


19 


Arabidopsis 


AT4G37510.1 


87 


sinensis 


13R f 


GTCCAAGTTCRATRTTYCTWGCTTC 


54.5 


36 


25 


Arabidopsis 


AT4G37510.1 


87 


sinensis 


14F+ 


T T A Y T C AAT G T T C AAC AG A 


57.6 


26.4 


19 


Arabidopsis 


AT4G02580.1 


86 


sinensis 


14Rt 


CACGKAYCATRCAAGGTGTTGTGCC 


62.7 


48 


25 


Arabidopsis 


AT4G02580.1 


86 


sinensis 


15F 


GCTYTWCCTTCRGAKACTGGTC 


57.3 


45.5 


22 


Arabidopsis 


AT5G37850.2 


88 


sinensis 


15R 


GTACWGARTKGATTGGATCCAC 


58.5 


41 


22 


Arabidopsis 


AT5G37850.2 


88 


sinensis 


16F+ 


CTTGTGGGAAKCCATCGGAC 


66.3 


55 


20 


Arabidopsis 


AT1G02640.1 


84 


Clementina 


16R f 


CGTTGTACATKGCYCTKGCYTCA 


64.9 


43.5 


23 


Arabidopsis 


AT1G02640.1 


84 


Clementina 


17F+ 


CAAGARGCKKTTTGTCGCC 


65.8 


47.4 


19 


Arabidopsis 


AT1G62050.1 


83 


Clementina 


17Rt 


CCAAGCKRARCGGTGGTGA 


57.6 


52.7 


19 


Arabidopsis 


AT1G62050.1 


83 


Clementina 


18F 


TTGAGTTRTCTCSWGAAGC 


57.2 


36.9 


19 


Arabidopsis 


AT3G07 160.1 


87 


Clementina 


18R 


GCARTGCAATRTCARCAGC 


55.5 


42.2 


19 


Arabidopsis 


AT3G07 160.1 


87 


Clementina 


19F 


TGAYCTYCTTGATGCRTTGGAC 


62.8 


41 


22 


Arabidopsis 


AT5G1 1170.2 


95 


sinensis 


19R 


GCATATWGARGGRAAATTGCATTC 


55 


33.4 


24 


Arabidopsis 


AT5G1 1170.2 


95 


sinensis 


20F 


AGTTTRCTCTCTGTTGATCCRAC 


51.9 


39.2 


23 


Arabidopsis 


AT2G25660.1 


93 


sinensis 


20R 


GCTGMACTTCAACTTCYGTWCCA 


56.8 


43.5 


23 


Arabidopsis 


AT2G25660.1 


93 


sinensis 


21F 


GGAAYTWAGGGAAGAATGC 


57.6 


42.2 


19 


Arabidopsis 


AT4G32 180.3 


90 


sinensis 


21R 


GCATCAASAAAYTGGAAYTC 


67.8 


30 


20 


Arabidopsis 


AT4G32 180.3 


90 


sinensis 


22F 


GATGGCTCGTGAAAGTGCTC 


65 


55 


20 


Arabidopsis 


AT2G27600.1 


91 


Clementina 


22R 


CCACGYKKAC C AC A Y AARGAAT C 


56.6 


39.2 


23 


Arabidopsis 


AT2G27600.1 


91 


Clementina 


23F 


GATGCRTTGGACTTYAATCAA 


58.6 


33.4 


21 


Arabidopsis 


AT5G1 1170.2 


95 


Clementina 


23R 


GACATKCCAGARTGGATGCATA 


57.7 


41 


22 


Arabidopsis 


AT5G1 1170.2 


95 


Clementina 


24F 


AGCTTYTAGCSGACAATGC 


59.8 


42.2 


19 


Arabidopsis 


AT5G26680.2 


90 


Clementina 


24R 


GTAAATGCTCATGCTAGCATCAA 


62.9 


39.2 


23 


Arabidopsis 


AT5G26680.2 


90 


Clementina 


25F 


GACAAGGTTCTCATGGARAGC 


54.4 


47.7 


21 


Arabidopsis 


AT5G54 160.1 


80 


sinensis 


25R 


C C AC C WT C AAGMA Y T G C AT C 


69.9 


45 


20 


Arabidopsis 


AT5G54 160.1 


so 


sinensis 


26F 


TGGACACTTCGAGGRCTTTG 


60.4 


50 


20 


Arabidopsis 


AT1G67060.1 


86 


sinensis 


26R 


ACCCATATKACRGCGAGGA 


56.1 


47.4 


19 


Arabidopsis 


AT1G67060.1 


86 


sinensis 


27F 


C T G T AA Y C ARG AC AAC CGCGTYAC 


57.8 


45.9 


24 


Arabidopsis 


AT5G2 1090.1 


87 


sinensis 


27R 


AGRTTT GAAT TWC C CAAAT C 


54.8 


30 


20 


Arabidopsis 


AT5G2 1090.1 


87 


sinensis 


28F 


TGGCTKGGWCARAAYCAGRTTC 


54.8 


41 


22 


Arabidopsis 


AT5G 11480.1 


86 


Clementina 


28R 


CATCWAYTTGTCGCATTTKGTGAA 


66.7 


33.4 


24 


Arabidopsis 


AT5G 11480.1 


86 


Clementina 


29F 


CTGGTTGTTGCTGATGA 


56.3 


47.1 


17 


Arabidopsis 


AT3G54300.2 


88 


Clementina 


29R 


TCCTTGACYCGTTCGAGA 


59.7 


50 


18 


Arabidopsis 


AT3G54300.2 


88 


Clementina 


30F 


TCGTYATWGCCTCCCTCGACGTTC 


64.7 


54.2 


24 


Arabidopsis 


AT4G 16600.1 


83 


Clementina 


30R 


CACYACYTTWGCTCCATCTTCYTSTTC 


71.8 


37.1 


27 


Arabidopsis 


AT4G 16600.1 


83 


Clementina 


31F 


GATGCTTTTGAATTCATTGTA 


56.8 


28.6 


21 


Arabidopsis 


ATMG00285.1 


94 


sinensis 


31R 


CATGGCAATTAAATCATRAGCCGA 


62.4 


37.5 


24 


Arabidopsis 


ATMG00285.1 


94 


sinensis 


32F 


GATCAGGTYCGTGGTGTMATGGA 


55.9 


47.9 


23 


Oryza 


13101.m04144 


94 


sinensis 


32R 


CATTTGGCTYTCYCCATA 


56.1 


38.9 


18 


Oryza 


13101.m04144 


94 


sinensis 


33Ft 


GTCGGCAAYCTCGAYCCCCA 


71.1 


60 


20 


Oryza 


13110.m02788 


93 


sinensis 


33R f 


T C C C AWAGT ARC T C C T CMGWAA 


51.4 


41 


22 


Oryza 


13110.m02788 


93 


sinensis 


34F* 


T C T C C AG AAT AC C G C AGGC AGC AAC 


74.5 


56 


25 


Arabidopsis 


AT 1G03 150.1 


97 


Clementina 


34R 


CACAAAGTARGCYTTATCAA 


55.4 


30 


20 


Arabidopsis 


AT 1G03 150.1 


97 


Clementina 


35F 


GTTGGSTGGTAYCAYTCACA 


68.5 


40 


20 


Oryza 


13104.m05825 


91 


Clementina 
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Appendix 1. Continued. 



Locus" 


Primer sequence (5'— 3') 


7 


%GC 


Nbases 


Reference taxon b 


EST C 


%ID d 


Citrus sp. e 


35R 


CAATRCCY GAWARC C AGC AT C 


51.5 


42.9 


21 


Oryza 


13104.m05825 


91 


Clementina 


36F 


ACCGGTGTCAAGAGRYTSTA 


62.4 


40 


20 


Oryza 


13111.m02571 


93 


Clementina 


36R 


GTGACAGAGTCATTGACATTGA 


60.2 


41 


22 


Oryza 


13111.m02571 


93 


Clementina 


37F 


TACAAGCTTWYKGGCATCAAG 


58.3 


38.1 


21 


Arabidopsis 


AT2G38 110.1 


85 


sinensis 


37R 


ACCACAGGRTCKARAACRGTGC 


60.1 


45.5 


22 


Arabidopsis 


AT2G38 110.1 


85 


sinensis 


38F 


AGGGTYAAGAATCCAGAATGG 


55.4 


42.9 


21 


Arabidopsis 


AT5G 13430.1 


82 


sinensis 


38R 


GCATTWGGYAARGGRATGCACC 


54.4 


45.5 


22 


Arabidopsis 


AT5G 13430.1 


82 


sinensis 


39F+ 


TCCTYCCYTACRCMTCTGAGC 


65.3 


47.7 


21 


Arabidopsis 


AT5G47030.1 


81 


sinensis 


39R f 


GTTGATGCKGGAAYKATRACCA 


57.1 


36.4 


22 


Arabidopsis 


AT5G47030.1 


81 


sinensis 


40F 


GAATTYGTGATCTCYAARKTSGATG 


53.6 


28 


25 


Arabidopsis 


AT5G 11 770.1 


88 


Clementina 


40R 


CATRGGCCAGATSGAKCCGSKACGA 


64.9 


48 


25 


Arabidopsis 


AT5G11770.1 


88 


Clementina 


41F 


GAAGAYTCKGTYAGGGTYAAGAA 


54.2 


34.8 


23 


Arabidopsis 


AT5G 13430.1 


82 


Clementina 


41R 


CAGCATTWGGYAARGGRATGCACC 


58 


45.9 


24 


Arabidopsis 


AT5G 13430.1 


82 


Clementina 


42F 


GCTGAAATYGCTKCTGGAAGTGA 


57.7 


43.5 


23 


Arabidopsis 


AT5G47840.1 


81 


Clementina 


42R 


TCAGGKACCAAYTGTCCTTTCTCCA 


71.5 


44 


25 


Arabidopsis 


AT5G47840.1 


81 


Clementina 


43F+ 


GAAC AAAAC T GAT C T T GT KGAC AA 


58.5 


33.4 


24 


Oryza 


13107.m03172 


93 


sinensis 


43R f 


CCAGCYTTRGCACTRGTYTCAA 


64.9 


41 


22 


Oryza 


13107.m03172 


93 


sinensis 


44Ft 


C C T T C AAC AGAT AC AAC AAC AT GC A 


66.7 


40 


25 


Arabidopsis 


AT3G57670.1 


96 


sinensis 


44R f 


TCCATGYCCCCACATATGCA 


64.7 


50 


20 


Arabidopsis 


AT3G57670.1 


96 


sinensis 


45F 


GC GAGARAART CAGCTGAYCCA 


59.5 


45.5 


22 


Oryza 


13102.m04682 


95 


sinensis 


45R 


GCAGTCCAYTTRAATATGTTKGAATC 


59.4 


30.8 


26 


Oryza 


13102.m04682 


95 


sinensis 


46F 


AGGCAAGTMTCMATAGAGGA 


55 


40 


20 


Oryza 


13107.m03172 


92 


Clementina 


46R 


CCAGCYTTRGCACTRGTYTCAA 


64.9 


41 


22 


Oryza 


13107.m03172 


92 


Clementina 


47F 


TGAGACAGGGTGTWCTTGGYATYAA 


52.7 


40 


25 


Oryza 


13103.m04131 


92 


Clementina 


47R 


GGATKGTTACRAGATCMGGYAGAG 


53.9 


41.7 


24 


Oryza 


13103.m04131 


92 


Clementina 


48F 


C AGC T GAY C C AGA Y AT Y CART T A 


54 


34.8 


23 


Oryza 


13102.m04682 


95 


Clementina 


48R 


GCAGTCCAYTTRAATATGTTKGAATC 


59.4 


30.8 


26 


Oryza 


13102.m04682 


95 


Clementina 



Note: %GC = percent G-C content; Nbases = number of nucleotides that comprise the primer; T m = melting temperature. 

a Primer names with asterisks (*) indicate primers that did not meet the necessary melting temperature criteria; f indicates primer pairs that were able to successfully 
amplify in at least one specimen. 

b The model organism reference in IntrEST from which the primer was developed. 
c The expressed sequence tag record number that was used to develop the marker. 
d Percent shared identity between the reference taxon and Citrus species. 
e Species of Citrus (C. sinensis or C. Clementina) that was used to develop the primer. 



Appendix 2. List of species, vouchers, and geographic origin with GenBank accession numbers for all putative gene regions. GPS coordinates were not included with 
some of the herbarium vouchers, which is reflected when no coordinate is given. 



Beiselia mexicana Forman; Pell s.n. (TEX). Mexico. ETS: FJ233929, ITS: 
JF919030, RPT6A Intron: KF906106, BXL2 Intron: KF906094, mtATP Syn- 
thase D Intron: KF906084. 

Protium copal (Schltdl. & Cham.) Engl.; Killeen 3136 (MO). Mexico. 15°15'S, 
067°00'W. ETS: AY964612, ITS: KF906073, RPT6A Intron: KF906108, 
BXL2 Intron: KF906095, mtATP Synthase D Intron: KF906085, Rab6 
Intron: KF906120. 

Aucoumea klaineana Pierre; Walters 466 (MO). Gabon. 00°07'12"S, 
011°42'57"E. ETS: KF906082, RPT6A Intron: KF906105, BXL2 Intron: 
KF906093. 

Boswellia sacra Flueck; Weeks 01-X-08-03 (TEX). N.E. Africa. ETS: 
AF445957, ITS: AF455880, RPT6A Intron: KF906107, Rab6 Intron: 
KF906119. 

Bursera sarukhanii Guevara & Rzed.; Weeks 00-VIII-18-06 (TEX). Mexico. 
ETS: AY315051, ITS: AF445820, RPT6A Intron: KF906109. 

B. simaruba (L.) Sarg.; Goldman s.n. (TEX). Florida, U.S.A. ETS: GQ378038, 
ITS: GQ378104, RPT6A Intron: KF906110, BXL2 Intron: KF906097, 
mtATP Synthase D Intron: KF906086. 

Commiphora ankaranensis (J.-F. Leroy) Cheek & Rakot.; Weeks 10-1-11-02 
(GMUF). Ankarana, Madagascar. 12°18'53"S, 49°20'18"E. ETS: KF035010, 
rfS: KF906081, RPT6A Intron: KF906118, BXL2 Intron: KF906104, 
mtATP Synthase D Intron: KF906092, Rab6 Intron: KF906128. 

Commiphora aprevalii (Baill.) Guillaumin; Weeks 10-1-20-04 (GMUF). Toliara, 
Madagascar. 22°57'16"S, 44°20'39"E. ETS: KF034992, ITS: KF906075, 



RPT6A Intron: KF906112, mtATP Synthase D Intron: KF906088, Rab6 
Intron: KF906122. 

Commiphora falcata Capuron; Weeks 10-1-26-03 (GMUF). Toliara, 
Madagascar. 23°01'29"S, 43°36'60"E. ETS: KF034994, ITS: KF906076, 
RPT6A Intron: KF906113, BXL2 Intron: KF906099, mtATP Synthase D 
Intron: KF906089, Rab6 Intron: KF906123. 

Commiphora grandifolia Engl.; Weeks 10-1-13-01 (GMUF). Ankarana, 
Madagascar. 12°34'49"S, 49°27'31"E. ETS: KF034996, ITS: KF906077, 
RPT6A Intron: KF906114, BXL2 Intron: KF906100, mtATP Synthase D 
Intron: KF906090, Rab6 Intron: KF906124. 

Commiphora lamii H. Perrier; Weeks 10-1-26-02 (GMUF). Toliara, Madagascar. 
23°01'29"S, 43°36'60"E. ETS: KF034998, ITS: KF906078, RPT6A 
Intron: KF906115, BXL2 Intron: KF906101, mtATP Synthase D Intron: 
KF906091, Rab6 Intron: KF906125. 

Commiphora pervilleana Engl.; Weeks 10-1-1 1-01 (GMUF). Ankarana, 
Madagascar. 12°18'53"S, 49°20'18"E. ETS: KF035005, ITS: KF906079, 
RPT6A Intron: KF906116, BXL2 Intron: KF906102, P43: KF906126. 

Commiphora sp. A. Weeks 10-1-09-01 (GMUF). Ankarana, Madagascar. 
12°14'11"S, 49°21'18'E. ETS: KF035009, ITS: KF906080, RPT6A 
Intron: KF9061 17, BXL2 Intron: KF906103, Rab6 Intron: KF906127. 

Commiphora sp. B. Weeks 10-1-15-04 (GMUF). Ankarana, Madagascar. 
12°34'49"S, 49°27'31"E. ETS: KF0906087, ITS: KF906074, RPT6A 
Intron: KF906111, BXL2 Intron: KF906098, mtATP Synthase D Intron: 
KF906087, Rab6 Intron: KF906121. 
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